Estimating the sample mean and standard deviation from the sample size, median, range and/or interquartile range

نویسندگان

  • Xiang Wan
  • Wenqian Wang
  • Jiming Liu
  • Tiejun Tong
چکیده

BACKGROUND In systematic reviews and meta-analysis, researchers often pool the results of the sample mean and standard deviation from a set of similar clinical trials. A number of the trials, however, reported the study using the median, the minimum and maximum values, and/or the first and third quartiles. Hence, in order to combine results, one may have to estimate the sample mean and standard deviation for such trials. METHODS In this paper, we propose to improve the existing literature in several directions. First, we show that the sample standard deviation estimation in Hozo et al.'s method (BMC Med Res Methodol 5:13, 2005) has some serious limitations and is always less satisfactory in practice. Inspired by this, we propose a new estimation method by incorporating the sample size. Second, we systematically study the sample mean and standard deviation estimation problem under several other interesting settings where the interquartile range is also available for the trials. RESULTS We demonstrate the performance of the proposed methods through simulation studies for the three frequently encountered scenarios, respectively. For the first two scenarios, our method greatly improves existing methods and provides a nearly unbiased estimate of the true sample standard deviation for normal data and a slightly biased estimate for skewed data. For the third scenario, our method still performs very well for both normal data and skewed data. Furthermore, we compare the estimators of the sample mean and standard deviation under all three scenarios and present some suggestions on which scenario is preferred in real-world applications. CONCLUSIONS In this paper, we discuss different approximation methods in the estimation of the sample mean and standard deviation and propose some new estimation methods to improve the existing literature. We conclude our work with a summary table (an Excel spread sheet including all formulas) that serves as a comprehensive guidance for performing meta-analysis in different situations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

0 Ju l 2 01 4 Estimating the sample mean and standard deviation from the sample size , median , range and / or interquartile range

In systematic reviews and meta-analysis, researchers often pool the results of the sample mean and standard deviation from a set of similar clinical trials. A number of the trials, however, reported the study using the median, the minimum and maximum values, and/or the first and third quartiles. Hence, in order to combine results, one may have to estimate the sample mean and standard deviation ...

متن کامل

Introduction to biostatistics: Part 2, Descriptive statistics.

Descriptive statistics include measures of central tendency and variability. Measures of central tendency include mean, median, and mode. The mean is the arithmetic average of data from interval or ratio scales. The median reflects the 50th percentile score. The mode is the most frequently occurring value of a data distribution. Measures of variability include range, interquartile range, standa...

متن کامل

Estimating the mean and variance from the median, range, and the size of a sample

BACKGROUND Usually the researchers performing meta-analysis of continuous outcomes from clinical trials need their mean value and the variance (or standard deviation) in order to pool data. However, sometimes the published reports of clinical trials only report the median, range and the size of the trial. METHODS In this article we use simple and elementary inequalities and approximations in ...

متن کامل

Microsoft Word - p107_vTypesetted_v2.docx

Characteristics of a population are often unknown. To estimate such characteristics, random sampling must be used. Sampling is the process by which a subgroup of a population is examined in order to infer the values of the population's true characteristics. Estimates based on samples are approximations of the population's true value; therefore, it is often useful to know the reliability of such...

متن کامل

Statistics on microcomputers: a non-algebraic guide to the appropriate use of statistical packages in biomedical research and pathology laboratory practice. 6. Statistical methods for diagnostic tests.

Reference ranges Most tests used in clinical medicine give a numerical result on a continuous measurement scale. Pathologists or clinicians attempt to interpret the result by comparing it with a "reference range" previously calculated from a study of people who do not have the disease in question. By current convention, the reference range includes all but the top and bottom 2 5% of the results...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 14  شماره 

صفحات  -

تاریخ انتشار 2014